A  Superlinearly  Convergent  Polynomial 
Primal-Dual  Interior-Point  Algorithm 
for  Linear  Programming 

Yin  Zhang 
and 

R.A.  Tapia 
June,  1990 

(Revised  December,  1991) 


TR90-40 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

DEC  1991  2- REPORT  TYPE 

3.  DATES  COVERED 

00-00-1991  to  00-00-1991 

4.  TITLE  AND  SUBTITLE 

A  Superlinearly  Convergent  Polynomial  Primal-Dual  Interior-Point 
Algorithm  for  Linear  Programming 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Computational  and  Applied  Mathematics  Department  ,Rice 

University, 6100  Main  Street  MS  134, Houston, TX, 77005-1892 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR'S  ACRONYM(S) 

11.  SPONSOR/MONITOR'S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

18.  NUMBER  19a.  NAME  OF 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE 

unclassified  unclassified  unclassified 

31 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


A  Superlinearly  Convergent  Polynomial 
Primal-Dual  Interior-Point  Algorithm 
for  Linear  Programming 

Yin  Zhang  *and  Richard  A.  Tapia  t 
June,  1990 

(Revised  December,  1991) 


Key  Words:  Linear  programming,  Primal-dual  interior-point  algorithms,  Polyno¬ 
miality,  Quadratic  and  superlinear  convergence. 

Abbreviated  Title:  A  Superlinearly  Convergent  Polynomial  Algorithm 


‘Department  of  Mathematics  and  Statistics,  University  of  Maryland,  Baltimore  County  Campus, 
Baltimore,  MD  21228,  and  visiting  member  of  the  Center  for  Research  in  Parallel  Computation,  Rice 
University.  This  author  was  supported  in  part  by  NSF  Coop.  Agr.  No.  CCR-8809615,  NSF  DMS- 
9102761  and  DOE  DE-FG05-91ER25100. 

t  Department  of  Mathematical  Sciences  and  the  Center  for  Research  in  Parallel  Computation,  Rice 
University,  Houston,  Texas  77251-1892.  This  author  was  supported  in  part  by  NSF  Coop.  Agr.  No. 
CCR-8809615,  AFOSR  89-0363,  DOE  DEFG05-86ER25017  and  ARO  9DAAL03-90-G-0093. 


1 


Abstract 


The  choice  of  the  centering  parameter  and  the  step-length  parameter  are  the 
fundamental  issues  in  primal-dual  interior-point  algorithms  for  linear  programming. 
Various  choices  for  these  two  parameters  have  been  proposed  that  lead  to  polyno¬ 
mial  algorithms.  Recently,  Zhang,  Tapia  and  Dennis  derived  conditions  for  the 
choices  of  the  two  parameters  that  were  sufficient  for  superlinear  or  quadratic  con¬ 
vergence.  However,  prior  to  this  work  it  had  not  been  shown  that  these  conditions 
for  fast  convergence  are  compatible  with  the  choices  that  lead  to  polynomiality; 
none  of  the  existing  polynomial  primal-dual  interior-point  algorithms  satisfies  these 
fast  convergence  requirements.  This  paper  gives  an  affirmative  answer  to  the  ques¬ 
tion:  can  a  primal-dual  algorithm  be  both  polynomial  and  superlinearly  convergent 
for  general  problems?  We  construct  and  analyze  a  “large  step”  algorithm  that  pos¬ 
sesses  both  polynomiality  and,  under  the  assumption  of  the  convergence  of  the 
iteration  sequence,  (Q-superlinear  convergence.  For  nondegenerate  problems,  the 
convergence  is  actually  Q-quadratic. 


1  Introduction 


We  consider  linear  programs  in  the  standard  form: 

minimize  cTx 

subject  to  Ax  —  b,  (1.1) 

x  >  0, 


where  c,  x  £  Rn,  b  £  Rm,  A  £  RmXn(m  <  n)  and  A  is  assumed  to  have  full  rank  m. 
The  first-order  optimality  conditions  for  (1.1)  can  be  written 


Ax  —  b 
AtX  +  y  —  c 
XYe 


=  0,  (x,  y)  >  0, 


(1.2) 
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where  A  and  y  are  dual  variables,  X  =  diag(x),  Y  =  diag(y)  and  e  has  all  components 
equal  to  one.  To  facilitate  our  presentation,  we  will  eliminate  the  dual  variable  A  from 
the  above  system  (though  such  an  elimination  may  not  be  advisable  from  a  practical 
point  of  view).  Let  B  £  R.(n-m)Xn  be  any  matrix  such  that  the  columns  of  BT  form  a 
basis  for  the  null  space  of  A.  Pre-multiply  the  second  equation  by  the  nonsingular  matrix 
[AT  Bt]t.  Notice  that  BAT  =  0,  so 


0  = 


A 

B 


(AT  A  +  y-c) 


(  AAT\  +  A(y  ~c)\ 
^  By  —  Be 


Since  AAT  is  nonsingular,  A  is  uniquely  determined  once  y  is  known.  Removing  the 
equation  for  A,  we  arrive  at  the  following  2 n  by  2 n  nonlinear  system  with  non-negativity 
constraints  on  the  variables: 


F(x,y) 


(  Ax  —  b  ^ 
By  —  Be 
XYe 


=  0,  (*,»)>  0. 


(1.3) 


By  the  feasibility  set  of  problem  (1.3)  we  mean: 


ft  =  {(s,  y)-x,ye  R",  Ax  =  b,By  =  Be ,  (x,  y)  >  0}. 

A  feasible  pair  (x,y)  €  ft  is  said  to  be  strictly  feasible  if  it  is  positive.  In  this  work  we 
tacitly  assume  that  strictly  feasible  points  exist. 

It  is  easy  to  see  that  for  ( x,y )  £  0,  ||F(x,y)||i  =  xTy  which  can  be  shown  to  be  the 
duality  gap  for  problem  (1.1);  we  will  use  the  duality  gap  as  the  merit  function  for  our 
algorithm,  i.e.,  the  criterion  that  tells  us  when  one  feasible  point  should  be  preferred  to 
another. 

Mathematically  speaking,  the  concepts  of  polynomiality  and  rate  of  convergence  are 
incompatible.  Polynomiality  is  meaningful  only  for  algorithms  that  converge  in  a  finite 
number  of  steps,  while  rate  of  convergence  is  defined  only  for  algorithms  that  take  an 
infinite  number  of  steps  to  converge.  When  we  say  that  an  interior-point  algorithm  is 
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polynomial,  we  have  in  mind  integral  (or  rational)  data  and  finite  termination.  On  the 
other  hand,  when  we  say  the  same  algorithm  is  linearly  convergent,  for  example,  we  do 
so  in  the  traditional  numerical  analysis  sense.  With  this  understanding,  we  can  discuss 
polynomiality  and  rate  of  convergence  of  an  algorithm  at  the  same  time. 

It  is  clearly  desirable  to  develop  algorithms  that  possess  both  polynomiality  and  fast 
asymptotic  convergence,  or  in  other  words,  both  good  global  behavior  and  good  local 
behavior.  To  our  knowledge,  the  only  prior  work  in  this  direction  was  Yamashita  [10]. 
Based  on  the  multiplicative  penalty  function  of  Iri  and  Imai  [2],  Yamashita  constructed 
a  polynomial  primal  algorithm  and  demonstrated  its  quadratic  convergence  under  the 
following  two  assumptions:  (i)  the  optimal  objective  value  is  known,  and  (ii)  the  iteration 
sequence  converges  to  a  nondegenerate  optimal  vertex.  The  first  assumption  is  in  general 
not  realistic.  The  second  assumption  is  very  restrictive  because  most  practical  problems 
are  degenerate. 

The  objective  of  this  work  is  to  construct  a  primal-dual  interior-point  algorithm  for 
problem  (1.1)  that  possesses  both  polynomiality  and  fast  convergence  under  more  realistic 
and  less  restrictive  assumptions.  We  construct  such  an  algorithm  and  show  that  it  takes 
at  most  0(nL )  iterations  to  reduce  the  duality  gap  to  2~L.  Moreover,  we  demonstrate 
that  this  algorithm  gives  quadratic  convergence  for  nondegenerate  problems  and  gives 
Q-superlinear  convergence  for  degenerate  problems. 

Subscripts  will  be  used  to  distinguish  values  of  quantities  at  a  particular  iteration  and 
superscripts  will  indicate  components  of  vectors.  We  also  use  the  notation: 

min(u)  =  min  v'  and  max(u)  =  max  v' 

1  <i<n  v  ’  1  <i<n 

for  a  vector  v  €  Rn.  The  symbol  ||  •  ||  denotes  the  1 2  norm  unless  otherwise  stated.  We  will 
use  the  standard  big-0  notation  in  this  paper;  in  particular,  for  a  sequence  {T*,}  C  R" 
and  a  positive  sequence  {ajt}  C  R,  Vk  =  O(ak)  implies  the  existence  of  positive  constants 
/?  and  ko  such  that  ||u*||  <  /3ak  for  all  k  >  fc0- 

The  paper  is  organized  as  follows.  In  Section  2,  we  describe  a  general  interior-point 
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algorithmic  framework  for  problem  (1.1)  based  on  the  nonlinear  system  (1.3)  and  give  a 
brief  survey  of  existing  results  for  algorithms  that  fall  into  this  framework.  In  Sections  3 
and  4,  we  specify  our  procedures  for  determining  the  step  length  and  for  choosing  the 
centering  parameter.  We  state  our  algorithm  in  Section  5.  Global  linear  convergence  (and 
polynomiality)  is  established  in  Section  6.  Quadratic  convergence  for  nondegenerate 
problems  is  established  in  Section  7,  and  superlinear  convergence  for  all  problems  is 
established  in  Section  8.  Concluding  remarks  are  given  in  Section  9. 

2  General  Algorithm 

Algorithm  1  (General  Algorithm) 

Given  a  strictly  feasible  pair  ( x0 ,  yo).  For  k  =  0, 1, 2, . . .,  do 

Step  1  Compute  the  Newton  step 

=  -[F'(xk,yk)]-lF(xk,yk) 

and  the  centering  step 

/  Axc  \  (  0  \ 

* 

/  \e ) 

Step  2  Choose  crk  €  (0, 1)  and  form  the  combined  step 

(  Axk\  (  Ax"  \  (  Axck\ 

—  +  &k 

\AykJ  ^  Ayjf  y  ^  A y% 

Step  3  Choose  rk  £  (0, 1)  and  set  ak  =  rkak,  where 

.  _ _ -1 _ 

ak  min  (XkxAxk,YClAyk)' 


'  Ax"k 

V  AVk 
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Step  4  Compute  the  new  iterate 


(  %k+ 1 

M 

1  N 

+  Oik 

\  yk+ i  y 

V  »  / 

^  Ayk  ) 

We  will  now  briefly  comment  on  this  general  algorithmic  framework.  From  a  direct 
calculation,  we  have 


F'(x,y) 


A  0 
0  B 
Y  X 


(2.1) 


Since  we  have  assumed  that  A  has  full  rank,  it  is  a  straightforward  matter  to  verify 
that  F'(x,y)  is  nonsingular  for  any  positive  pair  (x,y).  In  addition,  relation  (2.5)  below 
guarantees  that  >  0.  Hence  the  iterates  produced  by  Algorithm  1  are  well-defined. 
Notice  that  the  restriction  oik  <  otk  guarantees  that  the  iterates  remain  strictly  feasible. 
Moreover,  we  have  the  following  useful  relationships: 


YkAx^  +  XkAy ”  =  -XkYke, 

(2.2) 

YkAxck  +  XkAy°  =  \xTkyke, 

(2.3) 

Yk  Axk  +  XkAyk  -  -XkYke  +  ak^xlyke, 

(2.4) 

AxTkAyk  =  0, 

(2.5) 

Zjc+iJ/fc+i  =  xlyk{  1  -  (1  -  crk)ak). 

(2.6) 

We  have  stated  Algorithm  1  in  this  form  for  notational  convenience.  It  is  not  difficult 
to  verify  that  identical  iterates  {(xk,yk)}  can  be  generated  using  (1.2)  instead  of  (1.3). 
For  this  case,  there  is  no  need  for  introducing  the  matrix  B  (see  [11]  for  example). 

From  (2.6)  we  see  that  Algorithm  1  is  a  descent  algorithm  for  the  duality  gap 
||F(x,y)||i  =  xTy.  Moreover,  the  duality  gap  is  reduced  at  iteration  fc  by  a  factor 
1  —  ak(l  —  <7k)  <  1;  thus,  linear  convergence  will  be  obtained  if  {o^}  is  bounded  away 
from  zero  and  {crk}  is  bounded  away  from  one.  In  addition,  Q-superlinear  convergence 
will  be  obtained  if  0:^(1  —  cr^)  -4  1.  Observe  that  we  have  direct  control  over  the  choice  of 
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a*..  However,  we  do  not  have  the  freedom  of  choosing  ak  uniformly  bounded  away  from 
zero,  since  we  must  enforce  the  requirement  ak  <  ak  and  a k  is  not  directly  under  our 
control. 

A  number  of  existing  primal-dual  algorithms  fit  into  the  above  general  algorithmic 
framework  with  different  choices  for  the  parameters  a k  and  ak.  For  example,  in  the 
primal-dual  algorithm  of  Kojima,  Mizuno  and  Yoshise  [3],  ak  is  a  constant  and  ak  is 
a  particular  function  of  a k.  They  showed  that  their  algorithm  requires  at  most  0(nL ) 
iterations  to  reduce  the  duality  gap  by  a  factor  of  2~L .  Other  examples  include  the  Todd 
and  Ye  [9]  primal-dual  potential  reduction  algorithm  and  the  Monteiro  and  Adler  [7] 
path-following  primal-dual  algorithm.  Todd  and  Ye’s  algorithm  uses  the  choice 


where  v  is  a  constant.  In  Monteiro  and  Adler’s  algorithm, 

<*k  =  1 - 7= 

Vn 

where  6  is  a  constant  (Monteiro  and  Adler  actually  used  6  =  0.35  in  their  analysis).  In 
both  algorithms,  a  rather  short  step-length  ak  is  required.  Furthermore,  both  of  these 
algorithms  require  at  most  0(^/nL)  iterations  to  reduce  the  duality  gap  to  2~h .  This 
is  the  best  complexity  bound  obtained  for  linear  programming  so  far.  Observe  that  all 
three  algorithms  use  constant  ak.  In  each  of  the  three  cases  if  a  denotes  the  constant 
value  of  <7fc,  then  Q-superlinear  convergence  is  possible  (see  (2.6))  only  if 
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which  seems  extremely  unlikely. 

In  analyzing  the  convergence  of  Algorithm  1,  a  central  quantity  is 

_  xlyk/n 
Vk  mm(ATYfce)' 


(2.7) 
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Since  \xfyk  is  the  average  value  of  the  components  of  XkYke,  it  is  clear  that  qk  >  1.  In 
all  the  above  mentioned  polynomial  algorithms,  it  is  essential  that  the  sequence  { yk }  be 
bounded. 

Recently,  Zhang,  Tapia  and  Dennis  [11]  showed  that  under  appropriate  assumptions, 
Algorithm  1  has  fast  convergence.  The  following  two  theorems  summarize  their  main 
results.  By  a  nondegenerate  vertex  of  (1.1),  we  mean  a  feasible  point  of  (1.1)  that 
has  exactly  m  positive  components  and  the  corresponding  m  columns  of  A  are  linearly 
independent. 

Theorem  2.1  (Zhang,  Tapia  and  Dennis) 

Let  (x*,y*)  be  a  solution  of  problem  (1.3)  and  {(xk,yk)}  be  generated  by  Algorithm  1. 
Assume 

(i)  strict  complementarity  holds  at  (x*,y*), 

(ii)  x*  is  a  nondegenerate  vertex  of  (1.1), 

(iii)  °k  =  0(xfyk)  and  rk  =  1  -  0{xTkyk). 

If  {(xfc,  yk)}  converges  to  (a:*,y,);  then  the  convergence  is  Q-quadratic. 

Theorem  2.2  (Zhang,  Tapia  and  Dennis) 

Let  (a:*,?/,)  he  a  solution  of  problem  ( 1.3 )  and  {(xk,yk)}  be  generated  by  Algorithm  1. 
Assume 

(i)  strict  complementarity  holds  at  (a:*,?/*), 

(ii)  the  sequence  {r/fc}  is  bounded, 

(iii)  G k  — ►  0  and  rk  — >  1. 

If  {(xk,  yk)}  converges  to  (a;*,?/*),  then  the  duality  gap  sequence  {xf.yk}  converges  to  zero 
Q-superlinearly. 
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With  some  additional  work,  one  can  actually  demonstrate  that  the  sequence  {A^T^e} 
component-wise  converges  to  zero  Q-superlinearly. 

Several  assumptions  have  been  made  in  the  above  theorems.  Our  numerical  experi¬ 
ments  have  led  us  to  believe  that  the  strict  complementarity  assumption  is  not  restrictive. 
On  the  other  hand,  the  nondegeneracy  assumption  is  quite  restrictive  since  degeneracy 
exists  in  most  real-world  problems.  For  degenerate  solutions,  the  best  convergence  that 
has  been  established  is  Q-superlinear,  as  stated  in  Theorem  2.2. 

Although  many  of  the  existing  polynomial  primal-dual  interior-point  algorithms  sat¬ 
isfy  assumption  (ii)  of  Theorem  2.2,  none  of  them  satisfy  assumption  (iii),  i.e.,  ak  — >  0 
and  Tfc  — >  1.  In  fact,  in  several  polynomial  algorithms,  for  example  Todd  and  Ye’s  and 
Monteiro  and  Adler’s,  the  values  of  are  close  to  one.  From  Zhang,  Tapia  and  Den¬ 
nis  [11]  it  follows  that  these  algorithms  will  most  likely  have  slow  Q-linear  convergence. 
Hence  while  their  global  behavior  may  be  excellent,  their  local  behavior  can  be  improved. 

Recently,  in  a  number  of  performance-oriented  primal-dual  algorithms,  for  example 
the  ones  implemented  by  Choi  et.  al.  [1],  McShane  et.  al.  [6]  and  Lustig  et.  al.  [5], 
very  small  values  of  <7k  were  used  and  also  long  steps  were  taken.  Impressive  numerical 
results  were  obtained  from  these  implementations  though  polynomial  complexity  bounds 
are  not  known.  Hence  while  their  local  behavior  may  be  good,  their  global  behavior  is 
in  question  from  a  theoretical  standpoint. 

In  this  work,  we  develop  a  primal-dual  interior-point  polynomial  algorithm  that  gives 
quadratic  convergence  for  nondegenerate  solutions  and  superlinear  convergence  for  de¬ 
generate  solutions.  Hence,  from  a  mathematical  point  of  view,  both  the  global  and  the 
local  behavior  will  be  good.  This  new  algorithm  is  still  of  a  theoretical  nature.  However, 
the  fact  that  polynomiality  and  quadratic  or  superlinear  convergence  can  be  achieved 
simultaneously  by  one  algorithm  provides  motivation  for  practical  implementations  of 
the  conditions  ak  =  0(xlyk )  and  Tk  =  1  —  0{xkyk)  for  fast  convergence. 
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3  Determining  the  Step-Length 


In  the  previous  section  we  mentioned  that  both  polynomiality  and  superlinear  conver¬ 
gence  essentially  require  that  the  sequence  {??&}  be  bounded.  The  most  straightforward 
way  of  accomplishing  this  objective  is  to  explicitly  enforce  a  uniform  bound  on  the  quan¬ 
tity 

_  xk+iVk+i/n 
k+1  min(  Xfc+i  54+1  e) 

during  the  process  of  choosing  the  step-length  ak]  i.e.,  ask  that 


1  min(Xfc+1Ffc+1e) 

-  —  - -  /y 

Vk- (-1  xk+lVk+l/n 


for  some  7  >  0. 

Following  the  notation  used  in  [3],  let 


(3.1) 


=  xk  +  aAxk,  yk(a )  =  yk  +  aAyk, 

Ma)  =  Xk(a)Yk(a)e ,  fkve(a)  =  ±xk(a)Tyk(a),  (3.2) 

ffn(a)  =  min  (/*(«)),  /fax(a)  =  max(/*(a)). 

Note  that  the  above  quantities  actually  also  depend  on  the  centering  parameter  a  because 
both  Axk  and  Ayk  are  functions  of  a  (see  Step  3  of  Algorithm  1).  However,  since  we  will 
always  choose  a  before  we  determine  a,  it  will  suffice  to  consider  these  quantities  only 
as  functions  of  a  for  a  fixed  value  of  a. 

Whenever  a  —  0,  we  will  drop  the  argument  from  the  above  functions.  For  example, 

xk  =  ®fc(0),  /£ve  =  fkve( 0)  and  so  on.  From  the  formula  for  the  iterates  (Step  4  of 

Algorithm  1),  we  also  have  xk+1  =  xk(ak),  /££j  =  fkve(ak)  and  so  on. 

Using  the  above  notation,  we  choose  the  form  of  condition  (3.1)  as  requiring  ak  to 
satisfy 

00,  (3.3) 

where 

a.  e  hjri/n  0  <  7  <  /rv/r  <  1  and  7,Tfc  <  1.  (3.4) 
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In  the  case  1/rjo  >  7,  we  allow  1  /r/k  to  decrease  monotonically  as  long  as  1  /rjk  >7. 

In  the  following  development,  we  will  use  some  of  the  techniques  developed  by  Kojima 
et  al  in  [3]. 

Using  (3.2),  (2.4),  (2.5),  (2.6)  and  letting 

sk  =  diag(Axk)Ayk, 

we  have 

/1(a)  =  fi  -  (fi  -  <nfD*  +  4a2  (3.5) 

and 

/r»  =  /n  l-u -**)<*].  (3.6) 

Hence,  fk(a)  is  a  quadratic  (so  fkun(a)  and  /™ax(a)  are  piecewise  quadratic)  and  fkve(a) 
is  linear.  Clearly,  if  f™e(ak)  =  0,  then  (xk(ak),  yk(ak))  will  solve  problem  (1.3).  In  the 
sequel,  we  always  assume  fkve(ctk)  >  0. 

For  notational  convenience,  let  us  introduce  the  piecewise  quadratic  function 

h{a)  «  _  lkfr(a).  (3.7) 

It  follows  that  condition  (3.3)  is  equivalent  to 

h(a)  >0,  a  >  0.  (3.8) 

In  determining  ak  we  will  use  the  following  quantity: 

o?k  =f  min{a  >  0  :  h(a )  =  0}.  (3.9) 

Recall  that  ak  is  defined  in  Step  3  of  the  general  algorithm  (see  Section  2). 

Lemma  3.1  The  quantity  oPk  is  well  defined  and  oTk  £  (0,0*,).  Moreover,  condition  (3.3) 
is  satisfied  for  all  a  £  (0,cq!!]. 
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Proof:  Let  us  examine  the  function  h(a).  It  follows  from  the  definitions  of  7fc  and  ak 
that 


m  =  ff  -  iur  >  o 

and 

H&k)  =  /rn(«o  -  7 kfr(&k)  =  -7 kfr&k)  <  o. 

Hence  it  follows  from  the  continuity  of  h(a)  that  h(a)  has  a  root  in  [0 ,ak).  When 
MO)  >  0,  h{a)  obviously  has  a  root  in  (0,0:^).  When  /i(0)  =  0,  it  can  be  verified  that 
the  right-derivative  of  h(a)  at  a  =  0  is 

/i'(o+)  -  -(fr-^fr)+ik(i-ak)fr 

=  i(i-7*w  +  (7 k-f?n/fr)}fr 

=  (1  -  7 k)ak  >  0. 

Therefore,  h(a )  >  0  for  sufficiently  small  but  positive  a.  Consequently,  al  >  0. 

Since  h(ak)  <  0,  we  have  al  <  ak.  It  is  evident  that  h(a )  >  0  for  a  6  (0,q^],  i.e., 
condition  3.3  is  satisfied.  This  completes  the  proof.  □ 

An  equivalent  expression  for  al  is 

al  =  minja  >  0  :  f'k(a)  -  7fc/f s(a)  =  0,  *  =  1, 2, . . . ,  n}.  (3.10) 


The  computation  of  al  involves  calculating  the  roots  of  at  most  n  quadratics  and  therefore 
requires  0(n )  operations. 

In  addition  to  a  lower  bound  for  {f'k(ak)/ fkve(ak)}  (i.e.,  condition  (3.3)),  we  also 
impose  an  upper  bound  on  these  quantities;  namely,  we  require  ak  to  satisfy 


where 


ITMcr 

fr(°)  ~  ‘ 


a  >  0, 


(3.11) 


n  e  l/rv/r.  n,  i  <  nr/ir  <  r  < »  and  r,rt  >  i.  (3.12) 

Since  /)t(a)//fcve(a)  <  n  f°r  all  h  condition  (3.11)  will  be  redundant  if  Tfc  =  n.  Our 
reason  for  introducing  condition  (3.11)  is  to  improve  our  complexity  bound.  We  do  not 
feel  that  enforcing  this  condition  will  have  much  practical  significance. 
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Following  the  treatment  of  condition  (3.3),  we  introduce  the  piecewise  quadratic  func¬ 
tion 

H(a)  ■M  fr*(a)  -  rt/r(a).  (3.13) 

It  follows  that  condition  (3.11)  is  equivalent  to 

H(a)  <0,  a  >  0.  (3.14) 

We  will  also  use  the  following  quantity  in  determining  ak: 

ark  =  min{a  >  0  :  H(a)  =  0}.  (3.15) 

Analogous  to  Lemma  3.1  for  condition  (3.3),  we  have  the  following  lemma  for  condi¬ 
tion  (3.11). 

Lemma  3.2  The  quantity  a £  is  well-defined  and  6  (0,(7*,).  Moreover,  condition 
(3.11)  is  satisfied  by  all  a  €  (0,a£]. 

Proof:  The  proof  is  similar  to  that  for  Lemma  3.1,  so  we  omit  it.  □ 

Analogous  to  the  expression  (3.10)  for  condition  (3.3),  we  have  for  condition  (3.11) 

ark  =  min{a  >  0  :  f'k{a )  -  Tkf^ !(a)  =  0,  i  =  1, 2, . . . ,  n).  (3.16) 

For  the  sake  of  simplicity,  we  will  enforce  the  conditions 

7 k  <  1/2  and  Fk  >  2.  (3.17) 

The  specific  values  in  (3.17)  do  not  constitute  a  loss  of  generality  because  they  will  only 
affect  expressions  for  some  constants  in  our  analysis.  These  values  of  7^  and  will  result 
in  much  simplified  expressions  for  those  constants. 

From  (2.2),  we  see  that  for  fixed  Gk  a  larger  step  length  o.k  will  produce  a  larger 
reduction  in  the  duality  gap.  So  it  is  always  desirable  to  take  the  largest  step-length 
possible  as  long  as  other  requirements  are  satisfied.  Our  procedure  for  determining  the 
step-length  ak  is  summarized  as  follows. 
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Procedure  1  (Step-length  Criterion) 

Given  positive  constants  7  and  T  such  that 


0  <  7  <  min(l/2,  fr/fD,  max(2,  fiT/fD  <  T  <  n.  (3.18) 

Step  1  Choose  7fc  G  [7,  min(l/2,  ffa/fD]  Yk  G  [max(2,  fT'/fD,  H- 
Step  2  Compute  cCk  =  min{«  >  0  :  /£(a)  -  7*/re(a)  =  0,  *  =  1,2, . . .  ,n}  fi.e.,  (S.P);. 
Step  3  Compute  ark  =  min{o  >  0  :  f'k(a)-Tkfr(a)  =  0,  i  =  1, 2, . . . ,  n}  (i.e.,  (3.15)). 
Step  4  Let  ak  —  min(a^,a£). 

We  notice  that  the  above  procedure  of  choosing  the  step-length  bears  a  certain  simi¬ 
larity  to  a  procedure  recently  proposed  by  Mizuno,  Todd  and  Ye  [8], 

Now  we  prove  two  technical  lemmas  that  will  be  needed  in  the  later  development. 

Lemma  3.3  For  a  G  [0, 1], 

/fn(a)  >  /rn  -  (/fn  -  ^/r>  +  mm(sk)a\ 

/r»  <  fir  -(/rx-^/r>+max(^)a2. 

Proof:  We  first  look  at  the  linear  part  of  fk(a).  Since  for  all  i, 

' 

fi  -  (ti  -  n/r)*  =  n’  “ =  °’ 

1  nfr,  0  =  1, 

it  is  evident  that  for  a  G  [0, 1] 

/fn  -  (ftr  -  ^/r>  <n-  (fi  -  ^/r>  <  /rx  -  (/rx  - 

For  the  quadratic  terms,  we  clearly  have 

min(sfc)a2  <  s'ka2  <  max(sfc)a2. 
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By  adding  the  quadratic  terms  to  their  corresponding  linear  parts,  we  finish  the  proof. 

□ 

It  is  worth  noting  that  eTsk  =  0  by  (2.5).  Hence,  min(sk)  <  0  and  max(sfc)  >  0.  In 
the  sequel,  we  will  adopt  the  convention  that  £  =  +oo. 


Lemma  3.4  Let  ak  be  given  by  Procedure  1.  Then 


Moreover, 


ak  >  min 


yi-TtK/rtn-iK/r^ 

’  —  min(.Sfc)  ’  max(sjt)  j 


ak  >  min 


(3.19) 


(3.20) 


Proof:  From  (3.9),  cPk  is  a  positive  root  of  f'k{a)  -  ykf%ve(a )  for  some  index  i.  Noticing 
that  for  a  €  [0, 1]  fkve(a)  is  positive,  and  using  Lemma  3.3,  for  a  £  [0, 1],  yk  >  0  and  for 
all  index  z,  we  have 


/!(<»)  -  7 */r(«)  >  fr  -  (/r  -  -  7 kfrw 

=  iff"  -  7t/r)(l  -«)  +  (!-  7 iWr«  +  min(st)a2  (3-21) 

>  (1  -  lk)0kfr*a  +  min(st)a2. 


If  min(5,t)  —  0,  then  h(o)  >  0  for  a  7  (0, 1],  Therefore,  we  will  have  oPk  >  1.  Now  assume 
min(sfc)  <  0.  Then  the  quadratic  on  the  right-hand  side  of  the  last  inequality  in  (3.21) 
has  a  unique  positive  root 

(i  -  7 kfafr 


Otk  = 


—  min(sjt) 

Hence,  if  cPk  <  1,  from  (3.21)  we  must  have  cPk  >  ak.  This  proves  that 


cPk  >  min  1, 


(1  ~  7 k)crkfk' 

-min(s*) 


Similarly,  we  can  prove  that 

•  (,  (r*  —  1  )*kfr\ 

V  max(si)  J 

Combining  (3.22)  and  (3.23),  we  obtain  (3.19). 


(3.22) 


(3.23) 
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Finally,  (3.20)  follows  from  the  facts  that  H^Hoo  =  max{- min(s*),  max(5t)}  and 


<  i  -ik  <  i  <rfc-i. 


This  completes  the  proof. 


□ 


4  Choosing  the  Centering  Parameter 

We  will  use  the  following  notation: 

pk  =  Xkl  Axk  ,  qk  =Yk~1Ayk, 
p?  =  Xk1Ax?,  q?  =  Yk~1Ay 
Pk~Xk1Axk,  qk—Yk1Ayk , 

and 

Uk  =  \(ph  )*(?f)i,  i(Pfc)‘(?r)i,  io£)W)i). 

Lemma  4.1  Ifffn/fkve  >  1,  then 


(4.1) 


(4.2) 


“>k  <  n/ 72. 

Proof:  Multiply  both  sides  of  (2.3)  by  (XkYk)~2  and  consider  the  square  of  the  .£2 -norm 
of  both  sides.  Using  (2.5)  and  (4.1),  we  obtain 

\\(XkYk)hCk\\l  +  \\(XkYk)lqf\\l  = 

or  equivalently  after  dividing  both  sides  by  -xkyk, 

II  r;hf  II?  +  l|r;bf  III  =  eT7ie,  (4.3) 

where  Ty  =  i:  U)-1  is  a  diagonal  matrix.  Our  assumption  implies  that  the 

maximum  diagonal  element  of  { Tk }  is  bounded  above  by  I/7  and  the  minimum  diagonal 
element  of  { Tk  1 }  is  bounded  below  by  7.  Therefore,  from  (4.3)  we  have 

\(Pk)'\  <  Vnh  and  !(«*  )‘l  <  Vn/l- 
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Using  the  same  technique,  we  can  prove  that 

\(PkY\  <  \fn/l  <  Vn/l  and  l(?f  )'|  <  \AV7  <  y/n/ 7. 

From  the  definition  of  uk  and  the  above  estimates,  Lemma  4.1  follows  immediately.  □ 
We  now  state  our  procedure  for  choosing  the  centering  parameter  ak. 

Procedure  2  (Centering  Parameter  Criterion) 

Given 

<7  €(0,1),  =  PU>Yr"  (4'4) 

Step  1  Compute  u>k  from  (f.2). 

Step  2  Compute  p\  =  min(/U,  a/uk). 

Step  3  Choose  pk  €  [{pl  +  Pt)/2,p^]. 

Step  4  Let  ak  =  PkUk. 

Since  ak  =  pkuk  and  pk  €  [pl,pk],  we  have  ak  6  \pluk,  plu>k\.  In  addition,  we  require 
that  ak  be  greater  than  the  midpoint  of  the  interval.  This  requirement  is  needed  in  our 
proof  of  superlinear  convergence.  It  is  evident  that  ak  is  bounded  away  from  one  because 
&k  &  <1-  The  reasons  why  the  centering  parameter  is  so  chosen  will  hopefully  become 
clear  as  our  discussion  proceeds. 

5  Algorithm  Description 

Now  we  formally  state  our  primal-dual  interior-point  algorithm. 

Algorithm  2  Suppose  given  a  strictly  feasible  pair  ( Xo,yo ).  Choose  positive  constants  7 
and  T  such  that  (see  (3.18)) 

0  <  7  <  min(l/2, max(2 ,/Tx//D  <  T  <  n, 
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and  choose  a  <E  (0, 1).  Set  pl  =  72cr/2n  and  pu  >  j2a/n  (see  (44)).  For  k  =  0, 1,2, . . 
do 

Step  1  Compute  the  Newton  step  and  the  centering  step  from  Algorithm  1. 

Step  2  Choose  ak  by  Procedure  2  and  form  (Axk,  Ayk)  from  Algorithm  1. 

Step  3  Choose  a ;k  by  Procedure  1. 

Step  4  Form  (xk+i,yk+i)  from  Algorithm  1. 

The  procedure  for  determining  the  step-length  ak  can  be  implemented  in  an  effective 
manner.  Its  cost  is  somewhat  higher  than  the  ratio  test  that  is  used  in  most  of  the 
practical  implementations.  On  the  other  hand,  our  procedure  for  choosing  the  centering 
parameter  crk  requires  extra  work  when  compared  to  the  more  standard  method.  The 
standard  practice  is  to  choose  the  centering  parameter  prior  to  computing  the  steps, 
then  one  only  needs  to  solve  once  for  the  combined  step  (Newton  step  plus  the  centering 
parameter  times  the  centering  step).  Since  Algorithm  2  requires  the  information  obtained 
from  the  Newton  step  and  the  centering  step  to  choose  the  centering  parameter,  it  requires 
one  to  solve  for  the  two  steps  separately  and  then  combine  them. 

6  Global  Linear  Convergence 

Theorem  6.1  (Global  Linear  Convergence) 

Let  {(xk,yk)}  be  generated  by  Algorithm  2.  Then 

xk+iVk+i  <  (1  -  h/n)xlyk, 

for  some  6  satisfying 

c  >  v{l-v)l2 

16T 
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Proof:  We  need  to  estimate  ||-Syt||oo  in  (3.20).  Let  the  index  j  be  such  that  ||^||oo  =  |s*|. 
Observe  that 

\Moo  =  \AxkAyk\  =  \(xipi)(y:kqi)\  =  \{xJkyl)(piq3k)\ 

<  ma.x(XkYke)\\diag(pk)qk\\00 

=  frX\\diag(pZ  +  akpCk){q"  +  akpck)  |U 

<  fTM  i  +  O2 

<  4/rw 


Hence,  it  follows  from  (3.11),  (3.20)  and  Procedure  2  that 


ak  >  min  1, 


Pkfl 


8/j 


max 

k 


>  min 


(6.1) 


Substituting  pl  (see  (4.4))  into  the  above  expression,  we  obtain 

.,2 


oik  > 


<77 

16rn 


The  proof  is  completed  by  substituting  the  above  inequality  into  (2.6)  and  noticing  that 
crk  <  cr.  □ 

The  following  corollary  follows  immediately  from  Theorem  6.1.  By  a  standard  argu¬ 
ment,  it  leads  to  polynomiality  assuming  integral  data. 


Corollary  6.1  Assume  that  a  strictly  feasible  pair  ( x0,y0 ),  constants  7  and  T,  both  in¬ 
dependent  ofn,  are  chosen  such  that  (3.18)  is  satisfied  and  xfi,y0  <  2uh ,  where  L  >  0  and 
v  is  a  positive  constant  independent  ofn.  Then  in  at  most  0(nL )  iterations,  Algorithm  2 
will  produce  ( Xk,yk )  such  that  xfyk  <  2~L. 

Proof:  From  Theorem  6.1, 

x^yk  <  (1  -  8/n)kxly0  <  (1  -  8/n)k2l'L. 

Let  (1  —  8/n)k 2vL  =  2~h  and  take  the  natural  logarithm  of  both  sides.  We  have  k  = 
—  (In2)(l  +  v)L/  ln(l  —  8/n).  Observe  that  for  x  £  (0, 1) 

00  k 

ln(l  -  *)  =  -  E  ~r  <  ~x- 

k- 1  K 
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Therefore, 


k  <  (ln2)(l  +  v)L/(6/n)  =  0(nL). 


This  completes  the  proof.  □ 

7  Quadratic  Convergence 

In  this  section,  we  will  apply  Theorem  2.1  to  establish  that  under  strict  complementarity 
and  nondegeneracy  assumptions  our  algorithm  converges  Q-quadratically.  It  can  be 
shown  that  the  nondegeneracy  and  strict  complementarity  assumptions  at  optimality 
imply  the  uniqueness  of  both  primal  and  dual  solutions.  We  have  already  established 
convergence  of  the  duality  gap  sequence  to  zero  in  the  preceding  section.  With  the 
uniqueness,  it  can  be  shown  that  the  convergence  of  the  duality  gap  implies  that  of  the 
iterates  to  the  unique  solution  (x „,?/*)  >  0.  What  we  must  verify  is  assumption  (iii)  of 
Theorem  2.1;  namely, 

ak  =  0{xTkyk)  and  rk  -  1  -  0{xlyk). 

Since  rk  =  ak/ak,  for  the  latter  it  suffices  to  show  that 

6tk  -*  1  and  ak  -  ak  =  0(xkyk).  (7.1) 

The  following  lemma  will  be  useful.  It  is  a  slightly  modified  version  of  Lemma  3.2  in 
[11].  We  refer  interested  readers  to  the  original  paper  for  its  proof. 

Lemma  7.1  (Zhang,  Tapia  and  Dennis) 

Let  (x*,t/*)  be  a  solution  of  problem  (1.3)  and  {(^fc,J/(t)}  be  generated  by  Algorithm  2. 
LetPkiPdiqk  and  be  defined  by  (4-1)-  Assume 

(i)  strict  complementarity  holds  at  (x*,?/*), 

(ii)  x*  is  a  nondegenerate  vertex  of  (1.1), 
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Then 


and 


(  n  \ 


Pk  = 


\  -1  / 


+  0(xlyk),  pck  = 


0 

xlvk/n 


jffykfn 
\  [XkYke\«  / 


+  0{xTkyk ), 


<lk  = 


I  -1  ^ 

/  ^Ivk/n  \ 

[XkYkeP 

-1 

xlvk/n 

+  0(xlyk),  qck  = 

[XkYke]™ 

0 

0 

\  0  J 

l  0  y 

+  0(xTky  k), 


where  the  number  of  zeros  is  m  in  p%  and  p% ,  and  n  -  m  in  qk  and  q% . 

Now  we  are  ready  to  state  and  prove  our  quadratic  convergence  theorem. 


Theorem  7.1  (Quadratic  Convergence) 

Let  (x+,y„)  be  a  solution  of  problem  (1.3)  and  {{xk,yk)}  be  generated  by  Algorithm  2. 
Assume 


(i)  strict  complementarity  holds  at  (a:*,  ?/*)_, 

(ii)  x„  is  a  nondegenerate  vertex  of  (1.1), 

(iii)  pu  is  sufficiently  large,  e.g.,  pu  >  16r. 

Then  {(xk,yk)}  converges  to  (x*,y*)  Q-quadratically. 

Proof:  We  first  prove  ak  =  0(xfyk).  Observe  from  Lemma  7.1  that  for  each  index  i 
either  the  “p”  terms  ((pf)1'  and  (p£)')  or  the  “q”  terms  and  (9f)*')  are  0(xjyk) 
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while  the  other  terms  are  bounded.  Thus,  the  quantity  uk  (see  its  definition  (4.2))  is 
0(xkyk).  So  is  ak  because  ak  <  puujk. 

Since  uk  — >  0,  from  the  choice  of  pk  in  Step  2  of  Procedure  2  we  have  for  k  sufficiently 
large 

puk  =  pu  and  pk  >  ^( pl  +  pu).  (7.2) 

We  observe  that  if  pu  is  sufficiently  large,  e.g.,  pu  >  16r,  (i.e.,  ak  is  not  forced  to  approach 
zero  too  quickly),  then  the  step-length  ak  will  eventually  be  equal  to  or  greater  than  one, 
as  can  be  seen  from  (6.1). 

Since  ak  =  0{xkyk)  and  (xjyk/n)/  min(Wt54e)  is  bounded,  the  elements  of  pk  and 
qk  are  either  0(xjyk)  or  —1  +  0(x^yk).  Therefore, 

mm(X^  Axk,  Y^Ayk)  =  min (pk,  qk)  =  -1  +  0(xlyk).  (7.3) 

By  examining  the  definition  of  ak  in  Step  (3)  of  Algorithm  1,  we  see  ak  =  1  +  0(xjyk). 
Consequently,  for  k  sufficiently  large  we  have 

1  <  Oik  <  =  1  +  0(xlyk). 

This  implies  (7.1)  and  completes  the  proof.  □ 

8  Superlinear  Convergence 

In  this  section,  we  will  apply  Theorem  2.2  to  establish  Q-superlinear  convergence  of 
Algorithm  2  for  general  problems.  We  must  show  that  assumption  (iii)  of  Theorem  2.2 
holds;  i.e., 

crk  — >  0  and  rk  — »  1. 

For  the  latter,  it  will  suffice  to  show  ak  — >  1  and  ak  —  ak  — >  0.  Without  the  nondegeneracy 
assumption,  we  can  no  longer  use  Lemma  7.1.  For  technical  reasons,  we  must  restrict 
further  the  choice  of  pk. 


22 


Denote  the  length  of  the  interval  \p\pk\  by  wk.  It  follows  from  (4.4),  Step  2  of 
Procedure  2,  and  Lemma  4.1  that 

o,2<j 

Pi  >  —  •  (8.1) 

n  v  ' 

Thus, 

*kd={pt-pl>^>0.  (8.2) 

Let  Efc  be  the  following  set  of  2 n  points 

s*  =  {-(pj')‘/(pf)i,-(«fr)i/(«f)i.  i  =  l,2,...,n} 

and  define  the  distance  from  cr  to  the  set  Sj,  as 

dist(<r,  Si)  =  min{[<7  -  c|  :  C  6  S*}. 

We  choose  ak  according  to  Procedure  2  with  the  additional  restriction  that 

dist(<rfc,  Sfc)  >  Trku)k/(8n  +  4).  (8.3) 

In  other  words,  we  require  not  only 

crk  €  [0.5(/  +  pl)u>k,  pluk]  (8.4) 

but  also  that  ak  be  bounded  away  from  the  set  Si  by  at  least  TTkuk/(8n  +  4).  Since  {7Ti} 
is  bounded  away  from  zero,  we  see  from  (8.3)  that  {dist(eri,  Si)}  is  bounded  away  from 
zero  if  {cui}  is  bounded  away  from  zero. 

The  purpose  of  introducing  condition  (8.3)  is  to  avoid  the  situation  where  pk  = 

( PkY  +  ak(pky  (say)  converges  to  zero  but  (pj* )*'  and  (pj?)1'  do  not.  Although  we  believe 
that  this  situation  is  extremely  unlikely  to  happen,  we  have  not  been  able  to  rule  it  out. 

Lemma  8.1  The  set  of  ak’s  satisfying  (8.3)  and  (8.f)  is  nonempty. 

Proof:  The  length  of  the  interval  in  (8.4)  is  KkLuk/2.  Partition  this  interval  into  2n  +  1 
equal  sub-intervals,  each  having  length  wkuk/(4: n  -f  2).  If  the  interior  of  any  one  of  the 
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sub-intervals  does  not  intersect  Yhk,  then  the  midpoint  of  that  sub-interval  will  satisfy 
(8.3)  and  (8.4).  Since  T,k  has  only  2 n  points,  it  cannot  intersect  the  interiors  of  all  the 
2n  +  1  sub-intervals.  This  proves  the  lemma.  □ 

Now  we  are  well-equipped  to  prove  our  superlinear  convergence  theorem. 

Theorem  8.1  (Superlinear  Convergence) 

Let  (x»,y,)  be  a  solution  of  problem  (1.3)  and  { (a:*,,  2/fc) }  be  generated  by  Algorithm  2 
with  the  restriction  (8.3)  on  the  centering  parameter  a k.  Assume 

(i)  strict  complementarity  holds  at  (x,,t/,), 

(ii)  pu  is  sufficiently  large,  e.g.,  pu  >  16T. 

U  K3'*:)  2/fc)}  converges  to  {xm,yf),  then  the  duality  gap  sequence  {xfyk}  converges  to  zero 
Q-superlinearly. 


Proof:  We  first  prove  ak  — >  0.  It  suffices  to  show  uk  — ►  0. 
Let  x\  >  0.  Obviously, 

1  =  lim  =  lim  (1  +  akp'k ). 

k-*o o  X'  ~  FkJ 


k—K> o 


This  implies  pk  — >  0,  because  { ak }  is  bounded  away  from  zero.  On  the  other  hand,  if 
x\  =  0,  then  y\  >  0  by  strict  complementarity.  The  same  argument,  interchanging  the 
roles  of  p\  and  qk,  gives  qk  — >  0.  Therefore,  for  each  index  i,  either 


A  =  ( Pk)'  +  etiPh)'  ^0  or  =  (?”)'  +  <n,(qk)'  -*  0. 


(8.5) 


We  will  prove  uk  — >  0  by  contradiction.  Suppose  the  opposite.  Then,  there  exists  a 
subsequence  {tUfc0}  C  that  is  bounded  away  from  zero.  This  in  turn  implies,  from 

(8.3),  that  {dist(<7fc0,  £fco)}  is  bounded  away  from  zero  (recall  that  Kk  is  bounded  away 
from  zero). 

We  have  shown  that  for  each  index  i,  either  p[  — *•  0  or  qj.  —>  0.  Without  loss  of 
generality,  assume  p\  — >  0.  We  now  show  that  {(p£0)'}  converges  to  zero.  Otherwise, 
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there  exists  a  subsequence  {(/>£)'}  C  {{pkoY}  such  that  {|(p£)‘|}  is  bounded  away  from 
zero.  For  this  subsequence, 

pi,  =  «)‘ + =  (p£)‘k  +  (pJFM,)’]  -  o. 

This  implies 

^  +  (PkiY/ipZY  -►  °- 

However,  this  cannot  be  true  because  {dist((jfc, ,  E*., )}  C  {dist(cr*.0,  Efco)}  is  bounded  away 
from  zero.  Hence,  (p^Q )'  — >  0. 

Now  in  view  of  (8.5)  we  also  have  (pj^)*  — *  0.  Similarly,  we  can  prove  that  if  q{  —*  0, 
then  we  have  both  (qjY0Y  —*  0  and  (qkoY  — »  0.  Therefore,  for  each  index  i,  either  (pj^)1 
and  (pk0)\  or  ( q[ Jo)'  and  (q%Q )'  converge  to  zero.  Since  all  these  sequences  are  uniformly 
bounded  (see  the  proof  of  Lemma  4.1),  this  leads  to  u>ko  — >  0  (see  definition  (4.2)), 
contradicting  the  hypothesis  that  {u^}  is  bounded  away  from  zero.  This  proves  that 
Uk  — *  0.  Consequently  ak  — >  0. 

Now  we  prove  ak  —>  1.  Note  that  (2.4)  can  be  written  as 

Pk  +  qk  =  ~e  +  ak\xlyk{XkYk)~l  t. 

Since  ^xkyk(XkYk)~1e  is  bounded  above  by  I/7,  as  ak  — >  0,  we  have 

Pk  +  qk  -*  -e. 

We  have  shown  that  for  each  i,  either  p\  ->  0  or  q\  — ►  0.  Therefore,  all  p\  and  q'k  converge 
to  either  0  or  -1.  This  again  implies  that  ak  — >  1  (see  (7.3)).  In  view  of  (6.1)  and  (7.2), 
ak  will  eventually  be  equal  to  or  greater  than  one  if  pu  is  sufficiently  large,  e.g.,  p“  >  16T. 
Hence, 

1  <  <  «Jt  ->  I- 

This  completes  the  proof.  □ 
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9  Concluding  Remarks 


In  this  paper,  we  have  shown  that  the  two  fundamental  parameters  in  primal-dual 
interior-point  algorithms  for  linear  programming  can  be  chosen  in  such  a  way  that  both 
polynomiality  and  superlinear  convergence  are  achieved.  If  the  solution  is  a  nondegener¬ 
ate  vertex,  then  in  addition  to  superlinear  convergence  we  have  quadratic  convergence. 

The  current  practices  in  some  of  the  state-of-the-art  implementations  of  primal-dual 
interior-point  algorithms  have  the  following  common  fundamental  features.  First,  they 
allow  iterates  to  be  very  close  to  the  boundary  of  the  positive  orthant;  second,  they 
phase  out  the  centering  steps  at  a  fast  pace.  The  theory  established  in  Zhang,  Tapia 
and  Dennis  [11]  has  already  provided  theoretical  justification  for  such  a  practice  from 
the  viewpoint  of  fast  convergence.  This  paper  provides  further  theoretical  justification 
for  such  a  practice  from  the  viewpoint  of  polynomiality.  In  summary,  one  can  indeed, 
under  reasonable  conditions,  accomplish  both  objectives  —  good  global  behavior  and 
good  local  behavior. 

We  recently  learned  of  a  new  result  by  Guler  and  Ye  [4].  When  applied  to  linear 
programming,  it  says  that  condition  (3.1)  will  guarantee  strict  complementarity  for  any 
limit  point  of  the  iteration  sequence  generated  by  an  interior-point  algorithm.  This 
result  nicely  complements  the  Zhang-Tapia- Dennis  theory  (i.e.,  Theorems  2.1  and  2.2) 
and  therefore,  the  strict  complementarity  assumptions  in  Theorems  7.1  and  8.1  are  no 
longer  necessary. 
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